Multi-armed bandit

Results: 113



#Item
101Mathematical optimization / Machine learning / Multi-armed bandit / Stochastic optimization / Optimal design / Dynamic programming / Statistics / Operations research / Systems engineering

Keeping Your Options Open Forand, Jean Guillaume Working Paper No. 557 October[removed]UNIVERSITY OF

Add to Reading List

Source URL: rcer.econ.rochester.edu

Language: English - Date: 2011-09-10 11:03:12
102Game artificial intelligence / Search algorithms / Stochastic optimization / Monte Carlo method / Probabilistic complexity theory / Minimax / Microsoft Certified Professional / Simulation / Multi-armed bandit / Mathematics / Applied mathematics / Statistics

Information Sharing Current Concepts Information Sharing in MCTS UEC 7th Symposium 2013

Add to Reading List

Source URL: pasky.or.cz

Language: English - Date: 2013-03-19 08:56:45
103Stochastic optimization / Decision theory / Markov decision process / Gittins index / Statistics / Machine learning / Multi-armed bandit

Multi-armed Bandit Problems with Dependent Arms Sandeep Pandey

Add to Reading List

Source URL: www.cs.cmu.edu

Language: English - Date: 2007-06-18 15:15:12
104Statistical inference / Shrinkage estimator / Least squares / Machine learning / Estimator / Reinforcement learning / Mean squared error / Sampling / Multi-armed bandit / Statistics / Estimation theory / Statistical theory

Bandits for Taxonomies: A Model-based Approach Sandeep Pandey

Add to Reading List

Source URL: www.cs.cmu.edu

Language: English - Date: 2007-01-18 12:51:31
105Stochastic optimization / Reinforcement learning / Algorithm / Bandit / Statistics / Machine learning / Multi-armed bandit

Mortal Multi-Armed Bandits Ravi Kumar Yahoo! Research Sunnyvale, CA[removed]removed]

Add to Reading List

Source URL: www.cs.cmu.edu

Language: English - Date: 2008-10-21 03:12:22
106Information retrieval / Machine learning / Human–computer interaction / Recommender system / Multi-armed bandit / Algorithm / Personalization / Greedy algorithm / Statistics / Mathematics / Information science

Hybrid-ε-greedy for Mobile Context-aware Recommender System Djallel Bouneffouf, Amel Bouzeghoub & Alda Lopes Gançarski

Add to Reading List

Source URL: www-inf.int-evry.fr

Language: English - Date: 2013-03-11 08:37:28
107Machine learning / Reinforcement learning / Bayesian inference / Q-learning / Bayesian network / Kullback–Leibler divergence / Multi-armed bandit / Prior probability / Supervised learning / Statistics / Bayesian statistics / Statistical theory

PDF Document

Add to Reading List

Source URL: www.aaai.org

Language: English - Date: 2006-01-10 20:47:29
108Reinforcement learning / Q-learning / Multi-armed bandit / Statistics / SARSA / Normal distribution

PDF Document

Add to Reading List

Source URL: www.tokic.com

Language: English - Date: 2011-12-02 21:34:52
109Multi-armed bandit / Stochastic optimization / Reinforcement learning / SARSA / Normal distribution / Temporal difference learning / Q-learning / Statistics / Computational neuroscience / Machine learning

PDF Document

Add to Reading List

Source URL: www.tokic.com

Language: English - Date: 2011-12-02 21:34:52
110Stochastic optimization / Markov models / Artificial intelligence / Dynamic programming / Bandit / Markov decision process / Stochastic matrix / Game theory / Stochastic / Statistics / Machine learning / Multi-armed bandit

PDF Document

Add to Reading List

Source URL: homes.di.unimi.it

Language: English - Date: 2012-12-13 05:36:01
UPDATE